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Ongoing searches for a "quantum theory of gravity" have repeatedly led to the sug- 
gestion that space-time might ultimately be anisotropic (Finsler-like) and/or exhibit 
"multi-refringence" (multiple "signal cones"). While multiple (and even anisotropic) sig- 
nal cones can easily be dealt with in a unified manner, by writing down a single "Fresnel 
equation" to simultaneously encode all signal cones in an even-handed manner, once one 
moves "off the signal cone" and attempts to construct a full multi-refringent "spacetime 
metric" we shall see that the situation becomes considerably more problematic. (In con- 
trast, in general relativity, the single [mono-refringent] signal cone, together with a single 
scalar degree of freedom, the conformal factor, uniquely characterizes the full Lorentzian 
spacetime metric.) In fact in the multi-refringent case we shall report a significant neg- 
ative result, a "no-go" argument: Despite the fact that multiple signal cones can easily 
be unified in a Fresnel formalism, multi-refringent spacetime metrics strongly resist any 
attempt at unification — exhibiting serious and wide-spread pathologies that appear 
intrinsic, comprehensive, and unavoidable. 

To throw some light on this situation we shall work by analogy — we shall ex- 
plore the one physical situation where we have direct experimental evidence that both 
of these phenomena (anisotropy and multi-refingence) occur simultaneously: bi- axial bi- 
refringent crystal optics. We shall first develop a spacetime interpretation of the standard 
purely spatial Finsler structures, leading to the notion of pseudo-Finsler spacetimes. We 
shall then study the extent to which we can "unify" the spacetime treatment of multi- 
ple polarization modes by adopting a Lorentzian-signature generalization of Ricmann's 
prescription, and attempting to encode everything in one overarching pseudo-Finsler 
spacetime metric. We find that the natural "unified" quantity, while it defines a per- 
fectly good pseudo-Finsler norm, leads to a seriously diseased pseudo-Finsler metric. 
From the pseudo-Finsler norm we can easily set up a Fresnel equation, but the attempt 
to go "off-shell" to construct a full spacetime pseudo-Finsler geometry is doomed. 

The significance of this result extends beyond the optical framework in which (purely 
for pedagogical and illustrative reasons) we are working, and has implications for a wide 
class of models that attempt to introduce multi-refringence and Finsler-like intrinsic 
anisotropics to the low-energy manifold-like limit of "quantum gravity" . Our "no-go" 
result indicates that in multi-refringent models there is no simple or compelling way 
to construct any unifying notion of pseudo-Finsler spacetime metric. At the level of 
the metric (though not at the level of the signal cones) one has to resort to multiple 
independent pseudo-Finslerian metrics, often with troublesome "cusps" on the individual 
signal cones. This strongly suggests that, despite its initial popularity, multi-refringence 
is not a useful route to explore. 

Keywords: Finsler spacetime; Finsler norm; Finsler metric; Finsler space; bi-axial bi- 
rcfringent crystal optics. 
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1. Introduction 

The theoretical physics community has recently exhibited increa sing interest in the 
possibility of ultra- high-energy violations of Lorentz invariance JIHim and in the pos- 
sibility of quantum-gravity-induced spacetime anisotropics and multi-rcfringence. 
Specifically — recent speculations regarding Lorentz symmetry breaking and/or 
fundamental anisotropics and/or multi-refringence arise separately in the many 
and various approaches to quantum gravity. Such phenomena arise in loop quan- 
tum gravity,^ string modelsp! and causal dynamical triangulationsj^ and are also 
part and parcel of the "analogue spacetime" programme,^ and of many attempts 
at developing "emergent gravity" Recently, the ultra-high energy breaking of 
Lorentz invariance has been central to the Horava-Lifshitz models!^"^ Of course 
not all models of quantum gravity lead to high-energy Lorentz symmetry break- 
ing, and the comments below should be viewed as exploring one particular class of 
interesting models. 

If we wish to develop a geometric spacetime framework for representing Lorentz 
symmetry breaking, either due to spacetime anisotropics or multi-refringence, then 
it certainly cannot be standard pseudo-Riemannian geometry. This strongly sug- 
gests that carefully thought out extensions and modifications of pseudo-Riemannian 
geometry might be of real interest to both the general relativity and high-energy 
communities. In particular, when attempting to generalize pseudo-Riemannian ge- 
ometry the interplay between the "signal cones" of a multi-refringent theory and 
the generalized spacetime geometry is an issue of considerable interest: 

• In standard general relativity the (single, unique) signal cone almost com- 
pletely specifies the spacetime geometry — one need only supplement the 
signal cone structure with one extra degree of freedom at each point in 
spacetime, an overall conformal factor, in order to completely specify the 
spacetime metric, and thereby completely specify the geometry. (This is 
ultimately due to the fact that in standard pseudo-Riemannian geometry 
the scalar product is a simple bilinear operation.) 

• In multi-refringent situations it is quite easy to unify all the signal cones in 
one single Fresnel equation that simultaneously describes all polarization 
modes on an equal footing. 

• In a standard manifold setting, where we retain the usual commutative co- 
ordinates, we shall see that it is natural to demand that each polarization 
mode can be assigned an independent pseudo-Finsler metric. Earlier work 
on Finsler-like applications to phy sics includes that of Takanjm, Asanov-^, 
VacartPHHI anc [ Q^gj-gfUHUl More recent work has appeared in refer- 
ences QjjHSTl and other articles cited below. 

• What is exceedingly difficult, and we shall argue is in fact outright impos- 
sible within this framework, is to construct a unified formalism that moves 
"off-shell" (off the signal cones). 

• This is a negative result, a "no-go theorem", which we hope will focus 
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attention on what can and cannot be accomplished in any natural way 
when dealing with multi-refringent anisotropic Finsler-like extensions to 
general relativistic spacetime. 
• If one steps outside of the usual manifold picture, either by adopting non- 
commutative coordinates, or even more abstract choices such as spin foams, 
causal dynamical triangulations, or string-inspired models, then the issues 
addressed in this article are moot — our considerations are relevant only 
insofar as one is interested in the first nontrivial deviations from exact low- 
energy Lorentz invariance, and only relevant insofar as these first nontrivial 
effects can be placed in a Finsler-like setting. 

Considerable insight into such Finsler-like models can be provided by considering 
the "analogue spacetime" programme, where analogue models of curved spacetime 
emerge at some level from well unders too d physical systems.^ In particular, the 
physics of bi-axial bi-refringent crystal^^ provides a particularly simple physical 
analogue model fo r th e mathematical object introduced some 155 years ago by 
Bernhard Riemann^- and now known as a Finsler distance. (See |Appendix A] for 
some general mathematical background regarding Finsler spaces.) 

We shall soon see that this mathematical object can reasonably easily be ex- 
tended to a Lorentzian-signature pseudo-Finsler space-time, with an appropriate 
pseudo-Finsler norm, though defining a pseudo-Finsler metric is more subtle and 
fraught with technical problems. (It must be emphasized that, despite many misap- 
prehensions to the contrary, uni-axial birefringent crystals are relatively uninterest- 
ing in this regard; they do not lead to Finsler 3-spaces, but "merely" yield bi-metric 
Riemannian 3-geometries.) 

While the use of Finsler 3-spaces to describe crystal optics is reasonably common 
knowledge within the community of mathematicians and physicists studying Finsler 
spaces, it is very difficult to get a clear and concise explanation of exactly what is 
going on when one generalizes to Lorentzian signature space-time. In particular the 
fact that any relativistic formulation of Finsler space needs to work in Lorentzian 
signature ( — h ++), instead of the Euclidean signature (+ + ++) more typically 
used by the mathematical community, leads to many technical subtleties (and can 
sometimes completely invalidate naive conclusions). 

Note that for the purposes of the current article we are not particularly interested 
in the properties of birefringent crystals per se, we are really only interested in them 
as an exemplar of Finsler 3-space and Finsler space-time, as a guidepost to more 
complicated things that may happen in Finslerian extensions to general relativity. 
We shall already see a number of interesting pitfalls arising at this (in principle) 
quite elementary level. 

Our ultimate goal is to be able to say something about the (presumed) low- 
energy manifold-like limit of whatever quantum theory, or class of quantum the- 
ories, approximately reproduces Einstein gravity. To this end, our "no-go" result 
indicates that the popular assumption that anisotropies and multi-refingence are 
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likely to occur in "quantum gravity" leads to significant difficulties for the Einstein 
equivalence principle — since even the loosest interpretation of the Einstein equiv- 
alence principle would imply the necessity of a coherent formalism for dealing with 
all signal comes, and the spacetime geometry, in some unified manner. 

We conclude that, despite the fact that spacetime anisotropics and multi- 
rcfringence are very popularly assumed to be natural features of "quantum gravity" , 
and while these features have a straightforward "on-shell" implementation in terms 
of a suitably defined Fresnel equation, there is no natural way of extending them 
"off-shell" and embedding them into a single over-arching spacetime geometry. 

2. Outline 

Purely for the purposes of developing a useful analogy, which we shall use as a 
guide to the mathematics we wis h to develop, we will focus on the optical physics of 
bi-axial bi-refringent crystalsW^ After very briefly presenting the basic definitions, 
we show how various purely spatial 3-space Finsler structures arise. (Many purely 
technical details, when not directly involved in the logic flow, will be relegated to 
the appendices.) We again emphasize that uni-axial bi-refringent crystals, which are 
what much of the technical literature and textbook presentations typically focus on, 
are for our purposes rather uninteresting uni-axial bi-refringent crystals "merely" 
lead to bi-metric Riemannian space-times and are from a Finslerian perspective 
"trivial" . 

We shall soon see that even in three-dimensional space there are at least four 
logically distinct Finsler structures of interest: On the tangent space each of the 
two photon polarizations leads, via study of the group velocity, to two quite distinct 
Finsler spacetimes. On the co-tangent space each of the two photon polarizations 
leads, via study of the phase velocity, to two quite distinct co-Finsler spacetimes. 
The inter-relations between these four structures is considerably more subtle than 
one might naively expect. 

Additionally, (apart from some purely technical difficulties along the optical axes 
in bi-axial crystals), each of these four 3-dimensional spatial Finsler structures has 
a natural 4-dimensional extension to a spacetime pseudo-Finsler structure. Beyond 
that, there are reasonably natural ways of merging the two photon polarizations into 
"unified" Finsler and co-Finsler norms, closely related to the appropriate Fresnel 
equation, though the associated Finsler metrics are considerably more problematic 
- all these mathematical constructions do come with a price — and we shall be 
careful to point out exactly where the the technical difficulties lie. 

Finally, using this well-understood physical system as a template, we shall (in 
the spirit of analogue spacetime programme^) then ask what this might tell us 
about possible Finslerian extensions to general relativity, an d in particular to the 

loniioi I 

subtle relationship between bi-refringence and bi-metricityj 11 1 (or more generally, 
multi-refringence and multi-metricity). 

Specifically, we have investigated the possibility of whether it is possible to use- 
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fully and cleanly deal with both Finsler structure (anisotropy) and multi-refringence 
simultaneously. That is, given two (or more) "signal cones" : Is it possible to natu- 
rally and intuitively construct a "unified" pseudo-Finsler spacetime such that the 
pseudo-Finsler metric specifies null vectors on these "signal cones" , but has no other 
zeros or singularities! Our results are much less encouraging than we had originally 
hoped, for and lead to a "no-go" result. Our analysis suggests suggest that while 
pseudo-Finsler spacetimes are certainly useful constructs, they do not appear suit- 
able for dealing with multi-refringence — when trying to analyze the full spacetime 
geometry (instead of merely restricting attention to the signal cones) it seems phys- 
ically more appropriate to think of physics as taking place in a single topological 
manifold that carries several distinct pseudo-Finsler metrics, one for each polariza- 
tion mode. 



3. Finsler basics 

Mathematically, a Finsler function (Finsler norm, Finsler distance functi on) is d e- 
fined as a function F(x, v) on the tangent bundle to a manifold such thatp2n43J 

F(x,kv) = n F(x,v). (1) 

This then allows one to define a notion of distance on the manifold, in the sense 
that 

S(x(t i ),x(t f ))= f F (x(t),^-) dt (2) 



u 



is now guaranteed to be independent of the specific parameterization t. In the case 
of a (pseudo-)Riemannian manifold with metric g a b{x) one would take 



F{x, v) = ^gab(x) v a v\ (3) 

but a general (pseudo-)Finslerian manifold the function F(x, v) is arbitrary except 
for the linearity constraint in v. Note that in Euclidean signature, the general Finsler 
function F{x, v) is typically smooth except at v = 0. In Lorentzian signature how- 
ever, F{x, v) is typically non-smooth for all null vectors — so that non-smoothness 
issues have grown to affect (and infect) the entire null cone (signal cone) . As we shall 
subsequently see below, sometimes a suitable higher algebraic power, F 2n {x,v), of 
the pseudo-Finsler norm is smooth. 

In significant contrast to the above, to ensure smoothness of the Finsler metric, 
defined below, it is advisable (even in the Euclidean-signature case, let alone in 
Lorentzian signature) to go beyond the smoothness of F(x,v) and to demand the 
considerably stronger condition that the square F (x, v) be smooth, except possibly 
at v = 0. It is standard to define the (pseudo-)Finsler metric as 

. 1 d 2 [F 2 (x,v)\ 
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which then satisfies the constraint 



g ab (x,Kv) = g ab (x,v). (5) 



This can be viewed as a "direction-dependent metric" , and is clearly a significant 
generalization of the usual (pseudo-)Riemannian case. Almost all of the relevant 
mathematical literature has been developed for the Euclidean signature case (where 
g a b{x, v) is taken to be a positive definite matrix). Because of this assumption, any 
mathematical result that depends critically on the assumed positive definite nature 
of the matrix of metric coefficients cannot be carried over into the physically inter- 
esting pseudo-Finsler regime, at least not without an independent proof that avoids 
the positive definite assumption. (Unfortunately it is not uncommon to find signif- 
icant mathematical errors in the pseudo-Finsler physics literature due to neglect of 
this elementary point.) Basic references within the mathematical literature include 



There have been several recent attempts at using Finsler geometry in var- 
ious aspects of current physics research. For example, one use is related to 
anisotropic geometriesp^ another is related to modifications of general relativity 
and cosmology!^ In more fundamental work, Finsler geometries have been di rectly 
related to Lorentz symmetry breaking in so-called "very special relativity" j 46 | 47 | 
More recently, it has been noted that the geometry of the spatial slices of arbitrary 
stationary space-times can cleanly be interpreted in terms of the particular class of 
Finsler geometries known as Randers spaces!^ (One might also try to generalize 
the spacetime in a slightly different direction by adopting the notion of an "area 
metric" but we shall not take that particular route for now.) 

An application more directly related to our specific topic of interest was made by 
Duvalp^ where he uses 3-dimensional purely spatial Finsler geometries (effectively 
choosing a preferred inertial frame). The use of Finsler geometry for birefringent 
crystal optics was also analysed from a more mathematical point of view in reference 
1411 but this is again restricted to 3-dimensional, preferred-frame, purely spatial 
Finsler geometry. In contrast, we are for current purposes interested in building up 
a full (3+1) -dimensional pseudo-Finsler space-time geometry based on birefringent 
crystals, as at a minimum such a space-time formulation will certainly be necessary 
for developing the possible extensions of general relativity we are interested in. 



4. Space versus space-time: Interpretations of the Finsler structure 

The key physics point in bi-axial bi-refringent crystal optics is that the group ve- 
locities, and the phase velocities, are both anisotropic, and depend on direction in 
a rather complicated way. Technical details that would detract from the flow of the 
article are relegated to |Appcndix B| 
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4.1. From group velocity to Finsler structure 

We can summarize the situation by pointing out that the group velocity is given by 



2/ \ _ g2(n,n) ± v / q 2 (n,n) 2 - g (n,n) (n^n) 
9 go(n,n) 

where 92 (n, n) and qo(n,n) are known quadratic functions of the direction n. The 
coefficients in these quadratic forms are explicit functions of the components of the 
3x3 permittivity tensor. See equations (|B.9j) and (|B.10[) . The function v g (n) so 
defined is homogeneous of degree zero in the components of n: 

« 9 O n ) = v g (n) = v g (h). (7) 

The homogeneous degree zero property should remind one of the relevant feature 
exhibited by the Finsler metric. It is also useful to calculate the so-called "group 
slowness" : 

1 _ 92 (n, n) =p \Jq_2 (n, n) 2 - q (n, n) (n jn) 
v g (n? (n-n) ' U 

Let us now first define the quantities 



n 



-P3±(n) = — = V g 2 (n,n) =F vW n , n ) 2 - g (n,n) (n • n), (9) 
or adopt the perhaps more transparent notation 



I Id: 



^3 ± (dx) = = ^/«,(dx,dx) T ^(dx, dx) 2 - g (dx, dx) (dx • dx). (10) 

This is by inspection a 3-dimensional (Riemannian) Finsler distance defined on 
space, having all the correct homogeneity properties, -F3±(Kdx) — KFs±(dx). Phys- 
ically, the Finsler distance is in this situation the time taken for the wavepacket to 
travel a distance dx. 

One sometimes encounters comments in the literature to the effect that this 
specific Finsler distance is an example of a Kropina metric. Such comments are 
unfortunately false. If we define a = y/ gij dx 1 dx^ , and j3 — b{ dx 1 , then Kropina 
metrics are of the form F(dx) = a 2 //?, and the more general class of so-called (a, /3) 
metrics is of the form _F(dx) = /(a, ft). By inspection, the Finsler geometry arising 
in bi-axial optics is neither a Kropina metric nor even an (a, ft) metric. 

In contrast, the Finsler 3- norm i*3± defined above is certainly compatible with 
equation (3.14) of reference [50] To see this, it is a brief calculation to verify that 

||§i x dx|| ||e 2 x dx|| <H- ^/g 2 (dx, dx) 2 — g (dx. dx) (dx-dx), (11) 

where ei^ are the ray optical axes as defined in |Appcndix C| See also equation (|43|) 
below. Unfortunately in the bi-axial case equation (3.13) of reference [50 is not 
correct, unless you reinterpret the denominator in that equation not (as claimed) 
as a Riemannian metric, but instead as a Finsler metric in its own right. Nor in 
the uni-axial case is equation (3.13) of reference 1501 correct: the correct statement 
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is that in the uni-axial case both ordinary and extraordinary rays are governed by 
(distinct) but quite ordinary Riemannian geometries. 

To now extend the construction given above to full (3+1) dimensional spacetime, 
we first define a generic 4- vector 



dX = (di;dx), 

and then formally construct 

F 4± (dX) = v/-(di) 2 + F 3± (dx) 2 . 

That is 



F 4± (dX) = J-(dt) 2 



dx • dx 
w g (dx) 2 ' 



Even more explicitly, one may write 



F 4 ±(dX) 



(dtf + g 2 (dx, dx) =F V<72(dx, dx) 2 - q (dx, dx) (dx • dx) 



1/2 



The null cones (signal cones) of F 4 ±(dX) are defined by 

F 4± (dX) = & ||dx|| = u s (dx) dt. 



(12) 
(13) 

(14) 

(15) 
(16) 



So far this has given us a very natural pair of (3+l)-dimensional pseudo-Finsler 
structures in terms of the ray velocities corresponding to the two photon polariza- 
tions. 



For future use, let us now formally define the quantity 

d.s 4 = {F 4 (dX)} 4 (17) 
= {F 4+ (dX) F 4 _(dX)} 2 

= (dt) 4 - 2{dt) 2 q 2 (dx, dx) + q (dx, dx) (dx • dx). 

This is tantalizingly close to the quartic extension of the notion of distance that 
Bernhard Riemann speculated about in his inaugural lecturePSl See |Appendix A[ 
and especially equation (|A.1|) . This is certainly an example of a specific and simple 
4 t,l -root pseudo-Finsler norm that can naturally and symmetrically be constructed 
from the two polarization modes, and its properties (and defects) are certainly worth 
investigationg. Physically the condition ds — defines a double-sheeted conoid (a 
double-sheeted topological cone) that is the union of the propagation cone of the 
individual photon polarizations. However we shall soon see that even though this 
construction is formally very close to Riemann's original suggestion, when it comes 
to defining a Finsler spacetime metric this construction nevertheless leads to a 
number of severe technical difficulties; difficulties that can be tracked back to the 
fact that we are working in Lorentzian signature. 
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4.2. From phase velocity to co-Finsler structure 

In counterpoint, as a function of wave- vector the phase velocity is 

q 2 (k, k) ± Vg 2 (k,k)2- (Zo (k,k) (k-k) 



(k-k) 



(18) 



where the quadratics g^(k, k) and <7o(k,k) are now given by equations (|B.20[) and 
(|B.21[) . This expression is homogeneous of order zero in k, so that 



v p (nk) = v p (k) = v p (k). 



(19) 



Again, we begin to see a hint of Finsler structure emerging. Because k is a wave- 
vector it transforms in the same way as the gradient of the phase; thus k is most 
naturally thought of as living in the 3-dimensional space of co-tangents to physical 
3-space. Let us now define a co-Finsler structure on that co-tangent space by 



G 3± (k) =«p(k) ||k| 



(k) ± v /g 2 (k,k)2- go (k,k) (k-k). 



<12 



(20) 



We use the symbol G rather than F to emphasize that this is a co-Finsler structure, 
and note that this object satisfies the required homogeneity property 



G 3 ±(«k) = K G 3± (k). 



(21) 



Now let us go for a (3+1) dimensional spacetime interpretation: Consider the 4-co- 
vector 



k = (w; k) , 



and define 
That is 

More explicitly 
G i± (k) 



G 4 ±(fc) = v / -^ 2 + G 3 ±(k) 2 . 
G 4± (fc) = J-u 2 + v p (k) 2 (k-k). 



1/2 



- oj 2 + q 2 (k, k) ± Vg 2 (k,k) 2 -g (k,k) (k-k) 
We again see that this object satisfies the required homogeneity property 

Gi±(nk) — KGi±(k), 



(22) 
(23) 
(24) 

(25) 
(26) 



so that this object is indeed suitable for interpretation as a co-Finsler structure. 
Furthermore the null co-vectors of G4 are defined by 



Gi±(k) = <^> lj = ap(k) ||k||, 



(27) 



which is exactly the notion of dispersion relation for allowed "on mass shell" wave- 
4-vectors that we are trying to capture. Thus G 4 ± lives naturally on the co-tangent 
space to physical spacetime, and we can interpret it as a pseudo-co-Finsler structure. 
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We can again define a "unified" quantity 



G 4 (fc) 4 = {G 4+ (fc) G 4 -(fc)} 2 

= - 2w 2 g 2 (k, k) + g (k, k) (k • k) . (28) 



Physically, the condition G 4 (fc) = simultaneously encodes both dispersion rela- 
tions for the two photon polarizations. Furthermore, we see that G 4 (fc) is tanta- 
lizingly close to a co-tangent space version of the quartic expression considered by 
Bernhard Riemann!^] Again, see Appendix A and especially equation (|A. 1|) . Phys- 
ically the condition G 4 (fc) = defines a double-sheeted conoid (a double-sheeted 
topological cone) that is the union of the dispersion relations of the individual pho- 
ton polarizations. The vanishing of G 4 (fc) can be viewed as a Fresnel equation, and 
can indeed be directly related to Fresnel's condition for the propagation of a mode 
of 4-wavenumber k — (oj; k). As is the case for F^dX), we shall soon see that even 
though this construction for G 4 (fc) is formally very close to Riemann's original sug- 
gestion, this construction (once one tries to extract a spacetime co-Finsler metric) 
nevertheless leads to a number of severe technical difficulties; difficulties that can 
again be tracked back to the fact that we are now working in Lorentzian signature. 



5. Technical issues and problems 

The situation as presented so far looks very pleasant and completely under control 
— and if what we had seen so far were all there was to the matter, then the study 
of pseudo-Finsler space-times would be very straightforward indeed — but now let 
us indicate where potential problems are hiding. 

• Note that up to this stage we have not established any direct connection 
between the Finsler functions F±(n) and the co-Finsler functions G±(k). 
Physically it is clear that they must be very closely related, but (as we shall 
soon see) establishing the precise connection is tricky. 

• Furthermore, the transition from Finsler distance to Finsler metric requires 
at least two derivatives. Even in Riemannian signature this places some 
smoothness constraints on the Finsler distance, smoothness constraints that 
are nontrivial and not always satisfied. (In Riemannian signature potential 
problems are confined to the immediate vicinity of the zero vector.) 

• There are also additional problematic technical issues involving the 4- 
dimensional spacetime pseudo-Finsler and pseudo-co-Finsler metrics - 
again certain components of the metric are infinite, but this time the po- 
tential pathology is more widespread. (In Lorentzian signature potential 
problems tend to infect the entire null cone.) 
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5.1. Defining Finsler and co-Finsler 3-metrics 

The standard definition used to generate a Finsler metric from a Finsler distance 
is to set: 

which in this particular case implies 



, x = 1 d 2 [q 2 (n, n) =j= v / g 2 {n, n) 2 - g (n, n) (n ■ n)] 

It is convenient to rewrite the quadratics as 

g 2 (n,n) = [g 2 ]ij (31) 

5o(n,n) = [g ]ij r^n- 7 ; (32) 

since then we see 

gij(n) = [q 2 ]ij T (discriminant contributions). (33) 

Unfortunately we shall soon see that the contributions coming from the discriminant 
are both messy and in certain directions, ill-defined. This is obvious from the fact 
that squares of both Finsler functions are not even everywhere differentiable. 
Similarly we can construct a Finsler co-metric: 

which specializes to 



, ii (k) = 1 d 2 [q 2 (k,k) T Vg2(k,k) 2 - g (k,k) (kvk)] 

It is again convenient to rewrite the quadratics as 

ga(k,k) = [q 2 ] ij hk 3 - (36) 

«o(k,k) = [gb]« hk 3 - (37) 

since then we see 

h^Ck) = [qi] 13 T (discriminant contributions). (38) 

Again we shall soon see that the contributions coming from the discriminant arc, 
in certain directions, problematic. 
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5.2. Technical problems with the Finsler 3-metric 

Consider the (ray) discriminant 

D = q 2 (n, n) 2 - g (n, n) (n ■ n). (39) 

There are three cases of immediate (mathematical) interest: 

Isotropic: If v x = v y = v z then D = 0; in this case the two Finsler functions F± 
are equal to ech other. F3 (dx) then describes an ordinary Riemannian geometry, and 
F±(dX) an ordinary pseudo-Riemannian geometry. This is the standard situation, 
and is for our current purposes physically uninteresting. 

Uni-axial: If one of the principal velocities is distinct from the other two, then 
we can without loss of generality set v x = v y = v and v z = v e . The discriminant 
then factorizes into a perfect square 

D = — m • ( 4 °) 



o e 



In this case it is immediately clear that both i^-^dx) reduce to simple quadratics, 
and so describe two ordinary Riemannian geometries. Indeed 

^ 3+ (n) = (41) 



and 



In the language of crystal optics v and v e are the "ordinary" and "extraordinary" 
ray velocities of a uni-axial birefringent crystal. In geometrical language the two 
photon polarizations "see" distinct Riemannian 3-geometries F%± (dx) and distinct 
pseudo-Riemannian 4-geometries F4±(dX) — this situation is referred to as "bi- 
metric" . This situation is for our current purposes physically uninteresting. 

Bi-axial: The full power of the Finsler approach is only needed for the bi-axial 
situation where the three principal velocities are distinct. This is the only situation 
of real physical interest for us, as it is the only situation that leads to a non-trivial 
Finsler metric. In this case we can without loss of generality orient the axes so 
that v x > v y > v z . There are now two distinct directions in the x—z plane where 
the discriminant vanishes — these are the called the (ray) optical axes. After some 
manipulations that we relegate to | Appendix C| the discriminant can be factorized 
as [cf. equation (fTTj) ] 

D = ( 1~4,? 2 X [( n ' n ) ~ ( g i ' n) 2 ] [(" ' - • n) 2 ] , (43) 

z ±v x v z 

where the two distinct (ray) optical axes are 

a* / 4)2 — . q )2 \ 

(44) 
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Note that ei j2 are unit vectors (in the ordinary Euclidean norm) so that the dis- 
criminant D vanishes for any n oc §i i2 , and does not vanish anywhere else. We can 
thus introduce projection operators P\ and P 2 and write 

Pi(n,n) = (n-n)- (ej • n) 2 ; (45) 

P 2 (n,n) = (n-n)-(e 2 -n) 2 ; (46) 
Combining this with our previous results: 

{F 3± (n)} 2 = q 2 (n,n) T J P 1 (n,n) P 2 (n,n). (47) 

If we now calculate the Finsler metric [<73±(n)]jj we shall rapidly encounter technical 
difficulties due to the discriminant term. To make this a little clearer, let us define 



[ft(n)]« = 



d 2 y Pi(n,n) P 2 (n,n) 

dn l dni 



since then 



[ff3(n)]« = fe(n)] y T ^^[A(n)]«. 
Temporarily suppressing the argument n, we have 



A brief computation now yields the rather formidable result 



'p 



Pi 



1 

+ 4 



^^•Pi \\f^+d t d 3 p 2 
o t Pi d j P 2 + d i P 2 



jl/2 



2/^ 

1 



VAP2 
[9^-Pi p 2 + d i d j p 2 A 



- ^Pi a,Pi -4^ - a«p 2 ^p 2 



1/2 



53/2 
2 J 



.... ^.p 2 + ^ gj.^ _ ^ ^ _ di p 2 d .p 2 Q. 

4VPlp2 L ^1 ^ 2 



(48) 



(49) 



(50) 



(51) 



.(52) 



From this expression it is clear that along either optical axis, (as long as the op- 
tical axes are distinct, which is automatic in any bi-axial situation), some of the 
components of [Ps]^ , and therefore some of the components of the Finsler metric 
[33±]ij = [92] ij ± (constant) x [Psjy, will be infinite. 

To see this in an invariant way, let u and w be two 3- vectors and consider 



[P 3 (n)](u,w) = u 1 wK 



(53) 
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After a brief computation: 



[P 3 (n)](u,w) = 



1 



[ A (u, w) P 2 (n, n) + P 2 (u, w) P 1 (n, n) ] 



2 v /Pi(n,n)P 2 (n,n) 



1 



Pi (n, u) P 2 (n, w) + P 2 (n, u) P (n, w) 



VP(n, n)P 2 (n, n) 



-Pi(n,u) Pi(n,w) 



A(n,n) 
Pi(n,n) 



- P 2 (n,u) P 2 (n, w) 



A(n,n) 
A(n,n) 



• (54) 



This quantity will tend to infinity as n tends to either optical axis provided: 

• The optical axes are distinct. 

(If the optical axes are coincident then p = P 2 and so P 3 degenerates to 



One recovers the [for our purposes physically uninteresting] result for a 
uni-axial crystal.) 
• One is not considering the special case u = w = n. 
(In this particular special case P3 degenerates to 



which is well-behaved on either optical axis.) 
In summary: 

• The spatial Finsler 3-metric is [<73±]i7 generically ill-behaved on either op- 
tical axis. 

• This feature will also afflict the spacetime pseudo-Finsler 4-metric [g4±] a fc 
defined by suitable derivatives of the Finsler 4- norm F±± . 

• This particular feature is annoying, but seems only to be a technical prob- 
lem to do with the specifics of crystal optics, it does not seem to us to be 
a critical obstruction the developing a space-time version of Finsler geom- 
etry. Ultimately it arises from the fact that the "null conoid" is given by 
a quartic; this leads to two topological cones that for topological reasons 
always intersect, this intersection defining the optical axes. 

• It is the technical problems associated with the (3+1) spacetime Finsler 
metric, to be discussed below, which much more deeply concern us. 

5.3. Technical problems with the co-Finsler 3-metric 

The phase discriminant 



arising from the Fresnel equation (and considerations of the phase velocity) exhibits 
features similar to those arising for the ray discriminant. There are three cases: 



[P 3 (n)](u,w) ->A(u,w) = P 2 (u,w). 



(55) 




(56) 



D = </ 2 (k,k) 2 - go (k,k) (k-k), 



(57) 



6 May 2010; fflftX-ed May 11, 2010 2:31 IJMPD 



Finsler-ijmpd-4-arXiv 



Pseudo-Finslerian spacetimes and multi-refringence 15 

Isotropic: If the crystal is isotropic, then D = 0. (This again is for our purposes 
physically uninteresting.) 

Uni-axial: If the crystal is uni-axial, then D is a perfect square 

and so the co-Finsler structures G%± are both Riemannian: 

G 3+ (k) = ^k-n; (59) 

G 3 _(k)=v 2 e (k 2 x + k 2 y ) + v 2 k 2 z . (60) 

(This situation again is for our purposes physically uninteresting.) 

Bi-axial: Only in the bi-axial case are the co-Finsler structures G^± truly Fins- 
lerian. There are now two distinct (phase) optical axes (wave-normal optical axes) 
along which the discriminant is zero, these optical axes being given by 




' v% — v 2 



6 i,2= ±1/4—4; ;W-|— | , (61) 



in terms of which the phase discriminant also factorizes 



D 



(k ■ k) - (k ■ ei ) 2 (k-k)-(k-e 2 r . (62) 



The co-Finsler norm is then (now using projection operators Pi and P 2 based on 
the phase optical axes ei j2 ) 

|G 3± (k)} 2 = g 2 (k,k) t {V * ~ V * ] A/Pi(k,k) P 2 (k,k). (63) 

The co-Finsler metric is defined in the usual way 

1 9 2 [G 3± (k) 2 ] 

[h3±] (k) = 2 WW ■ (64) 

This now has the interesting "feature" that some of its components are infinite 
when evaluated on the (phase) optical axes. That is: The co-Finsler 3-metric is 
[/i3±] u ' generically ill-behaved on either optical axis. This feature will also afflict 
the pseudo-co-Finsler 4- metric [/i 4± ] ob defined by suitable derivatives of the Finsler 
4- norm Gi±. 



5.4. Technical problems with the (3+1) spacetime interpretation 

The ( 3 + 1 ) - dimens iona l spacetime objects that are closest in spirit to Riemann's 
original suggestionpS see |Appendix A[ are the quantities 

F 4 (dX) 4 = {F i+ (dX) F^(dX)} 2 ; (65) 



6 May 2010; fflftX-ed May 11, 2010 2:31 IJMPD 



Finsler-ijmpd-4-arXiv 



16 Jozef Skdkala and Matt Visser 



and 



G 4 (fc) 4 = {G 4+ (fc) G 4 -(fc)} 2 . 
as defined in equation (IT51) and (|2"5)) . This is tantamount to taking 



(66) 



F 4 (dX) = y/F A+ {dX)Fi-{dX); 



(67) 



and 



G 4 (fc) 



v/G 4+ (fc) G 4 _(fc). 



(68) 



Now F 4 (dX) and G 4 (fc) are by construction perfectly well behaved pseudo-Finsler 
and pseudo-co-Finsler norms, with the correct homogeneity properties — and with 
the nice and concise physical interpretation that the vanishing of Fi(dX) defines a 
double-sheeted "signal cone" that includes both polarizations, while the vanishing 
of G 4 (fc) defines a double-sheeted "dispersion relation" ("mass shell") that includes 
both polarizations. (Thus F^dX) and G 4 (fc) successfully unify the "on-shell" be- 
haviour of the signal cones in a Fresnel-like manner.) These objects are also the 
closest we can get to simply and cleanly implementing Riemann's 4 -root proposal 
in a Lorentzian-signature setting. 

While this is not directly a "problem" as such, the norms F^dX) and G 4 (fc) do 
have the interesting "feature" that they pick up non-trivial complex phases: Since 
F^±(dX) 2 is always real, (positive inside the propagation cone, negative outside), 
it follows that Fi±(dX) is either pure real or pure imaginary. But then, thanks to 
the additional square root in defining F^dX), one has: 

• F4,(dX) is pure real inside both propagation cones. 

• F4(dX) is proportional to = ^77^ between the two propagation cones. 

• -F 4 (dA) is pure imaginary outside both propagation cones. 

Similar comments apply to the pseudo-co-Finsler norm G 4 (fc). 

A considerably more problematic point is this: In the usual Euclidean signature 
situation the Finsler norm is taken to be smooth everywhere except for the zero 
vector — this is usually phrased mathematically as "smooth on the slit tangent 
bundle" . What we see here is that in a Lorentzian signature situation the pseudo- 
Finsler norm cannot be smooth as one crosses the propagation cones — what was 
in Euclidean signature a feature that only arose at the zero vector of each tangent 
space has in Lorentzian signature grown to affect (and infect) all null vectors. The 
pseudo-Finsler norm is at best "smooth on the tangent bundle excluding the null 
cones" . (In a mono-refringent case the squared norm, F 4 (dX) 2 , is smooth across the 
propagation cones, but in the bi-refringent case one has to go to the fourth power 
of the norm, F 4 (dX) 4 , to get a smooth function.) 

A "no go" result: Unfortunately, when attempting to bootstrap these two 
reasonably well-behaved norms to pseudo-Finsler and pseudo-co-Finsler metrics 
one encounters additional and more significant complications. We have already seen 
that there are problems with the spatial 3-metrics [<?3±]-y( n ) and [/i3±] iJ '(k) on 
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the optical axes, problems which are inherited by the single-polarization spacetime 
(3+l)-metrics [<74±] a &(n) and [ft-4±] ah (k), again on the optical axes. 
But now the spacetime (3+l)-metrics 



19 2 [F 4 (n) 2 ] 

[94\ab(n) = -- 



2 dn a dn h ' 



and 



ld 2 [G 4 (kf 



(69) 



(70) 



2 dk a dk b 

both have (at least some) infinite components — [gi\ab{n) has infinities on the entire 
signal cone, and [h,4\ ab (k) has infinities on the entire mass shell. Since the argument 
is essentially the same for both cases, let us perform a single calculation: 



g ab = \ d a db^[Fl F 2 



lhFi]^-+d b [F 2 ]^ 



(71) 
(72) 



so that 



9ab = 



21 F + 



d a db[Fl\—+d b d b [F_ i 



+- 



daF+dbF- + d a F-d b F+ - d a F+d b F+ 



d a F-d b F- 



F, 



(73) 



That is, tidying up: 



1 

9a b — 7^ 



F% F 1 

(.9+)ahTT + (.9-)a6TT 

d a F+d b F_ + d a F_d b F+ - d a F+d b F+^- - d a F_d b F_^- 



(74) 



The problem is that this "unified" metric g ab (n) has singularities on both of the 
signal cones. The (relatively) good news is that the quantity g ab {n)n a n b = F 2 (n), 
and so on either propagation cone F — > 0, so F(n) itself has a well defined limit. 
But now let n a be the vector the Finsler metric depends on, and let w a be some 
other vector. Then 

1 



g ab (n)n a w» = -n a w h d a d b {F 2 } 
= \™ h d b [F 2 ] 



^w b d b ^F\Fl 



= r 
i 

~~ 2 



d b [F 2 + ]— +d b [F_] 



2 F+ 



{[g + ] ab n a w b ) ± — + ({g_] ah n a W b }) 



F+ 



(75) 
(76) 
(77) 
(78) 

(79) 
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The problem now is this: g + and g_ have been carefully constructed to be individ- 
ually well defined and finite (except at worst on the optical axes). But now as we 
go to propagation cone "+" we have 

g ab (n) n a w b -> ~( [g+] ab n a w b ) = oo, (80) 

and as we go to the other propagation cone "— " we have 

g ab (n) n a w b -> 1( [g^] ab n a w») ?± = oo. (81) 

So at least some components of this "unified" pseudo-Finsler metric g ab (n) are 
unavoidably singular on the propagation cones. Related (singular) phenomena have 
previously been encountered in multi-component BECs, where multiple phonon 
modes can interact to produce Finslerian propagation conesPH 

Things are just as bad if we pick u and w to be two vectors distinct from "the 
direction we are looking in" , n. In that situation 



g ab (n) u a w b 



IF F 

- g+{u,w) -pP +g-(u,w) j± 

g+(u,n) g_(w,n) + g+(w,n) g-(u,n) 
F+F- 

F_ F + 
-9+(u,n) g+{w,n) - g-(u,n) g~{w,n) 



(82) 



Again, despite the fact that both g + and g_ have been very carefully set up to 
be regular on the propagation cones (except for the known, isolated, and tractable 
problems on the optical axes), the "unified" metric g a b(n) is unavoidably singular 
there — unless, that is, you only choose to look in the nn direction. 

If we give up the condition of "intuitiveness" , so that we abandon Riemann's 
4 th -root proposal, then F4 and G4 do not just factorize into a product of powers of 
the Finsler functions for individual polarizations. We could then look for more com- 
plicated ways of building "unified" Finsler and co-Finsler structures (constrained 
mainly by giving the correct light propagation cones in the birefringent crystal) , and 
we might be able to find an appropriate Finsler geometry (which might also fulfill 
some additional reasonable physical conditions). The necessary conditions for such 
a geometry are relatively easy to formulate, but because of lack of intuitiveness, 
and corresponding lack of dire ct p hysical motivation, the physical meaning of such 
an approach is highly doubtful!^ 

In short, our "no go" result is this: There is no natural and compelling way of 
assembling the individual Finsler metrics into a "unified" whole — this despite the 
fact that it is possible to assemble the individual Finsler norms into a "unified" 
whole that accurately reflects the behaviour expected from looking at the Fresnel 
equation or the on-shell dispersion relations. 
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6. Discussion and Conclusions 

The results of this investigation have been rather mixed. On the one hand we have 
seen how birefringent crystal optics is a good exemplar for providing a clean physical 
implementation of the mathematical notion of a Finsler norm and Finsler metric. 
On the other hand we have also seen how even this rather straightforward physical 
model, with its very clear and direct physical interpretation, nevertheless leads to 
a number of technical mathematical difficulties that are of physical importance. 

• While the 3-space Finsler and co-Finsler norms corresponding to the two 
photon polarizations are always well-defined, the 3-space Finsler and co- 
Finsler metrics have undesirable properties on the optical axes — ulti- 
mately this is due to the fact that the two photon polarizations have the 
same group velocity (and same phase velocity) only on the optical axes — 
so that the structure of the indicatrix (the surface describing speed as a 
function of direction) is complicated (continuous but not differentiable) on 
the optical axes. This feature then also afflicts the 4-space pseudo-Finsler 
and pseudo-co-Finsler metrics that cam be constructed for each individual 
photon polarization. While annoying, this problematic feature in 3-space is 
at least physically isolated to the optical axes. 

• It is when one tries to "unify" the two polarization modes into a single 
structure that deeper problems arise — again the 4-space pseudo-Finsler 
and pseudo-co-Finsler norms are certainly well defined (and are extremely 
close to Riemann's original conception of what a 4 t,l -order geometry should 
look like), but now the 4-space pseudo-Finsler and pseudo-co-Finsler met- 
rics arc singular on the entire signal cone and mass shell respectively. This 
problematic feature is intimately related to the fact that we are dealing 
with pseitiio-Finslcr and pseitrfo-co-Finsler geometries — it is a "divide by 
zero" problem, associated with non-zero null vectors on the signal cone 
(and mass shell), that leads to singular values for metric components. The 
feature seems to us to be quite comprehensive and to be generic to any 
attempt at merging pseudo-Finsler metrics with multi-refringence. 

We have worked through the mathematical aspects of the Finsler space interpre- 
tation of birefringent optics in some detail because this is a situation in which we 
know and understand the basic physics in considerable detail. Because of this, any 
problematic mathematical details or physics oddities we run across cannot be as- 
cribed to unknown details of the physics — any unusual features we encounter must 
be ascribed to our choice of mathematical formalism, and if they already arise in 
this very simple physical system, then any such unusual features are likely to be 
inherited by more complicated physical situations. (In particular, since our long 
range goal is to develop insight into "quantum gravity phenomenology" , issues we 
encounter here are likely to re-occur there, possibly with even more virulence.) 
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Overall, the lessons learned form this particular model have led us to re-evaluate 
and re-assess the notion of what it means to have a Lorentzian-signature Finsler 
geometry, and in particular what it means to have a Lorentzian-signature Finsler 
metric — and to re-assess the precise nature of the relationship between birefrin- 
gence and bimetricity (or more generally multi-refringence and mufti- metricity) . 
While pseudo-Finsler spacetimes are certainly useful constructs, in birefringent sit- 
uations it does not appear possible to naturally and intuitively construct a "unified" 
pseudo-Finsler spacetime such that the pseudo-Finsler metric is null on both "signal 
cones" , but has no other zeros or singularities — it seems physically more appropri- 
ate to think of physics as taking place in a single topological manifold that carries 
a number of distinct pseudo-Finsler metrics, one for each polarization mode. 

That is, the objects F<i±(dX) individually define perfectly good pseudo-Finsler 
norms and metrics, but the natural "unified" quantity Fi(dX), while it defines a 
perfectly good pseudo-Finsler norm, leads to a seriously diseased pseudo-Finsler 
metric. This is an important point, and we are still actively developing our ideas on 
this matter. 
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Appendix A. Bernhard Riemann and Finsler geometry 

To set the stage, we point out that "Finslerian" concepts have rather long history. 
As early as 155 years ago, in Bernhard Riemann's 1854 inaugural lecture)^ he 
made some brief speculations about possible extensions of what is now known as 
Riemannian geometry: 

The next case in simplicity includes those manifolds in which the line- 
element may be expressed as the fourth root of a quartic differential expres- 
sion. The investigation of this more general kind would require no really 
different principles, but would take considerable time and throw little new 
light on the theory of space, especially as the results cannot be geometrically 
expressed. . . 

... A method entirely similar may for this purpose be applied also to the 
manifolds in which the line-element has a less simple expression, e.g., the 
fourth root of a quartic differential. In this case the line-element, generally 
speaking, is no longer reducible to the form of the square root of a sum 
of squares, and therefore the deviation from flatness in the squared line- 
element is an infinitesimal of the second order, while in those manifolds it 
was of the fourth order. 
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In more modern language, Riemann was speculating about distances being defined 
by expressions of the form 

d.s 4 = gabcd dx a dx b dx c dx d . (A.l) 

Such manifolds, and their generalizations, have now come to be called Finsler 
geometries!^ In particular, Riemann's specific suggestion 

ds = \fg abcd dx a dx b dx c dx d , (A.2) 

is now known as a 4*' l -root distance, and naturally leads to the specific sub-class 
of Finsler norms known as n. -root distances. These and more general Finsler 
spaces are well-known in the mathematics communityP^HMl but considerably less 
common within the physics community. (See for example references [T2TI231 144H481 
and [50H5TI ) Perhaps the most extensive use of pseudo-Finsler geometries has been 
within the "analogue spacetime" community 5 ^ w here Fi nsler- like structures have 
arisen in th e contex t of normal mode analyses j3SU5SH 52] m multi-component BEC 
acoustics EKQHES a nd in nematic liquid crystals!^ 



Appendix B. Basics of bi-axial bi-refringent optics 

The basic optics reference we shall use is Born and Wolf, Principles of Optics^^ 
In particular we shall focus on Chapter XV, "Optics of crystals", pages 790-818. 
See especially pages 796-798 and pages 808-811. Specific page, chapter, and section 
references below are to the 7th (expanded) edition, 1999/2003. 

Permeability fi is taken to be a scalar, permittivity £y a 3 x 3 tensor. (This is 
an excellent approximation for all known optically active media.) By going to the 
principal axes we can, without loss of generality, take ey to be diagonal 

x 



e 




y 

e 







We furthermore define "principal velocities" 

c c 



v 



v 



i^z 



(B.l) 



(B.2) 



Note (this is a tricky point that has the potential to cause confusion) that v x is 
not the velocity of light in the x direction — since e x (and so v x ) is related to the 
properties of the electric field in the x direction, the principal velocity v x is instead 
the velocity of a light wave whose electric field is pointing in the x direction. That is, 
for light waves propagating in the y-z plane, one of the polarizations will propagate 
with speed v x . 



Appendix B.l. Group velocity and ray equation 

The group velocity, v g , in the framework used by Born and Wolf, is identical to the 
"ray velocity", and is controlled by the so-called "ray equation". See (15.2.29), page 
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797. To set some conventions, n will always denote a unit vector in physical space 
— a unit with respect to the usual Euclidean norm, while n is a generic position in 
physical 3-space. In contrast, k will be reserved for a unit wave- vector in the dual 
"wave- vector space" . 

Born and Wolf exhibit the ray equation in a form equivalent (Born and Wolf 
use t where we use n) to: 

i/vl-i/vy i/vi-i/vy i/vi-i/vi u - 1 - 6) 

Here the group velocity (ray velocity) is defined by looking at the energy flux and 

v 9 - v g n. (B.4) 

We can rewrite this as 

hlv 2 r h 2 v 2 h 2 z v 2 n _ ^ 

-T^T + -o^o + ~/2^ ~o = 0. (B.5) 



This form of the ray equation encounters awkward "division by zero" problems when 
one looks along the principal axes, so it is advisable to eliminate the denominators 
by multiplying through by the common factor (v 2 — v 2 )(v 2 — v 2 )(v 2 — v 2 ), thereby 
obtaining: 

nlvl(v 2 g - v 2 y ){v 2 g -v 2 z ) + n 2 y v 2 y (v 2 - v 2 )(v 2 - v 2 x ) 

+h 2 z v 2 z (v 2 -v 2 x )(v 2 -v 2 y ) = Q. (B.6) 

It is this form of the ray equation that, (because it is much better behaved), we 
shall use as our starting point. Now this is clearly a quartic in v g , and by regrouping 
it we can write 

4 r«2 2 i -2 2 i -2 21 

v g [n x v x + n y v y + n x v z \ 

2 l"~2 2/2 2\ i ' 2 2/ 2 2\ , -2 2l 2 2\1 

-v g [n x v x (v y + v z ) + n y v y {v z + v x ) + n z v z (v x + v y )\ 

+ [v 2 x v 2 y v 2 z ] = 0. (B.7) 



Equivalently 



4 l~~2 —2 -2 , -2 -2 -2 , "2 —2 —21 

g l n xV y v z + n y v z v x + n z v x v y j 



-«9 H{v v 2 +v z 2 ) + n 2 y {v z 2 +v x 2 ) 

+h 2 z (v x 2 +v y 2 )]+l = 0. (B.8) 
Now define two quadratics (in terms of the three direction cosines hi) 

q (h, n) = [h 2 x v y 2 v z 2 + h 2 y v z 2 v x 2 + h 2 z v x 2 v y 2 ] , (B.9) 



q 2 (h, h) = l [h 2 x {v y 2 + v z 2 ) + h 2 y (v- z 2 + v x 2 ) + h 2 z (v x 2 + v- 2 )} , (B.10) 



then 



..2fA\ - 92(n,n) ± ^ 2 (n,n) 2 - 9o(n,n) 



^(n) = ^ ' ' \ \ ^J-L. (B.ll) 

9 9o(n,n) 
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But, since n is a unit vector, we could equally well rewrite this as 
2/ A n g 2 (n, n) ± Vg 2 (n, n) 2 - g (n, n) (n ■ n) 



In this form both numerator and denominator are manifestly homogeneous and 
quadratic in the components of n, so for any 3-vector n (now not necessarily of unit 
norm) we can take the further step of writing 



2,, g 2 (n, n) ± v / g 2 (n, n) 2 - g (n, n) (n ■ n) 

V ° (n) = W^) • ( 3) 

The function v g (n) so defined is homogeneous of degree zero in the components of 
n: 

"fl(«n) =v g (n) =v g (n). (B.14) 

The homogeneous degree zero property should remind one of the relevant feature 
exhibited by the Finsler metric. It is also useful to note that 

1 g 2 (n,n) =F y/q 2 (n,n) 2 - <z (n,n) (n • n) 



(n • n) 



(B.15) 



Appendix B.2. Phase velocity and Fresnel equation 

In contrast, the phase velocity, in the framework used by Born and Wolf, is controlled 
by the so-called "equation of wave normals" , also known as the "Fresnel equation" . 
See equation (15.2.24), page 796. The relevant computations are similar to, but not 
quite identical to, those for the group velocity. 

Let us consider a plane wave exp(i[k • x — ojt]) and define the phase velocity by 

v p = v p k = ^ k (B.16) 

then the Fresnel equation is equivalent (Born and Wolf use s where we use k) to 

k% k„. 



- 0. (B.17) 



y2 y2 y2 y 2 y2 y 

u p u x u p v y u p v z 

This form of the equation exhibits "division by zero" issues if you try to look along 
the principal axes, so it is for many purposes better to multiply through by the 
common factor (w 2 — vD{v 2 p — Vy){v 2 p — w 2 ) thereby obtaining the equivalent of their 
equation (15.3.1) on page 806: 

kl(v 2 p - v 2 y )(v 2 p - v 2 ) + k 2 y (v 2 p - v 2 z )(v 2 p - v 2 x ) 

+k 2 z (v 2 p -v 2 x )(v 2 p -v 2 y )=0. (B.18) 

This is clearly a quartic in v p and by regrouping it, and using k • k = 1, we can 
write 



4 2 
v p - v p 



k 2 x (v 2 y + v 2 z ) + k 2 y (v 2 z + v 2 x ) + k 2 (v 2 x + v 2 y ) 



+ 



;222,;222,;222 

K v y v z + k y v z v x + k z v x v y 



0. (B.19) 
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Let us now define two quadratics (in terms of the direction cosines fcj) 
1 
2 



<l^.k } =l\kl(vl+vt) + kl(v 2 z +vl) + kl(vl+v*) , (Li.20) 



and 



g (k,k)= k 2 x v 2 y v 2 + k 2 y v 2 z v 2 x + k 2 v 2 x v 2 y , (B.21) 



so as a function of direction the phase velocity is 

v 2 p (k) = g 2 (k, k) ± ^(k.k^-goCk.k). (B.22) 

This is very similar to the equations obtained for the ray velocity. In fact, we can 
naturally extend this formula to arbitrary wave- vector k by writing 



2^ 92(k,k)± y g2 (k,k)2- go (k,k) (kTk) 
v p (k) = ^ . (B.23) 

This expression is now homogeneous of order zero in k, so that 

v p ( K k) =v p (k) =«p(k). (B.24) 
Again, we begin to see a hint of Finsler structure emerging. 

Appendix B.3. Connecting the Finsler and co-Finsler structures 

Connecting the ray-vector n and the wave-vector k in birefringent optics is rather 
tricky — for instance, Born and Wolf provide a rather turgid discussion on page 
798 — see section 15.2.2, equations (34)-(39). The key result is 

v g (n) h, _ v p (k) h _ ^ 



Vg{n) 2 -v 2 v p (k) 2 -v 2 ' 

which ultimately can be manipulated to calculate n as a rather complicated "ex- 
plicit" function of k — albeit an expression that is so complicated that even Born 
and Wolf do not explicitly write it down. Unfortunately all the extra technical ma- 
chinery provided by Finsler notions of norm and distance do not serve to simplify 
the situation. (The fact that phase and group velocities can be used to define quite 
distinct, and in some situations completely unrelated, effective metrics has also been 
noted in the context of acoustics ISQHS1I) 

Appendix C. Optical axes 

To find the ray optical axes we (without loss of generality) take v z > v y > v Zl and 
define quantities A± (this is of course the result of considerable hindsight) by: 

2 2 
2 y 2 

V x ~ ~\ 2 A2 ' V z = 1~> 2 A2 ' (C-l) 

1 — 1 + Al 



Furthermore eliminate h y by using 



V 



h 2 = l-h 2 x -h 2 z , (C.2) 
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then 



D = q 2 - qo 
1 



2 , M , £2)] 

x [(n x A + -n z A_) 2 -(A 2 h +A 2 _)]. 



= - [(n x A_|_ + h z A-)' A - (A 



Thus the (ray) optical axes are defined by 

(n x A + ±n z A_) 2 = (A 2 + +A 2 _). 
But thanks to the Cauchy-Schwartz inequality 

(n x A + ± n z A_) 2 < (n\ + n 2 z )(A 2 + + A 2 _) < (A 2 + + A 2 _). 



(C.3) 
(C.4) 
(C.5) 



Therefore on the (ray) optical axis we must have n y = 0, and (n 2 +h 2 ) — 1. So (up 
to irrelevant overall signs) 



ei,2 



±- 



A_i 







A_ 



\JA\ + Al ' ^A 2 , + A 2 
which we can recast in terms of the principal velocities as 



ei,2 




l/v 2 z -l/vl 



or 



ei,2 



»w 2 - w 2 ^ 



;^ 



lv 2 - v 2 z 



(C.6) 



(C.7) 



(C.8) 



These are the two ray optical axes. (Compare with equation (15.3.21) on p. 811 of 
Born and Wolf.) 

A similar computation can be carried through for the phase optical axes. We 
again take v z > v y > v z , and now define 



Eliminate k y by using 



k y — 1 k x 



A . 



Then 



D 



q\ - 94 
l 

4 

x 



(C.9) 
(CIO) 
(C.ll) 

(C.12) 



(k x A + + k z A_) 2 -(A 2 + + A 2 _ 
\k x A+-k z A_) 2 -(A\ + A 2 _) 
This tells us that the discriminant factorizes, always. The discriminant vanishes if 

(^A+±fc z A_) 2 = A^ + A 2 _. (C.13) 
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But by the Cauchy-Schwartz inequality 

(k x A+ ± fc z A_) 2 < (kl + k 2 z ){A 2 + + A 2 _) < (A 2 + + A 2 _). (C.14) 

Thus on the (phase) optical axis we must have k y = and (k 2 + kl) = 1. The two 
unique directions (up to irrelevant overall sign flips) that make the discriminant 
vanish are thus 

e li2 = |± A+ ; ; , A ' ] . (C.ir.) 
\ ^A\ + A 2 _ ^A\ + A 2 

which can be rewritten as 



I y2 y2 y2 y2 

x v . n ■ v 



Gl ' 2 yyVx-V 2 ' Q \VI-V 2 Z J ( ' C ' 16 ' ) 

These are the two phase optical axes. (Compare with equation (15.3.11) on p. 810 
of Born and Wolf.) 
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